Telescoping Jets: Multiple Event Interpretations with Multiple Ws 
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Jets at high energy colliders are complicated objects to identify. Even if jets are widely separated, 
there is no reason for jets to have the same size. A single reconstruction, or interpretation, of each 
event can only extract a limited amount of information. Motivated by the recently proposed Qjet 
algorithms, which give multiple interpretations for each event using nondeterministic jet clustering, 
we propose a simple, fast and powerful method to give multiple event interpretations by varying the 
parameter R in the jet definition. With multiple interpretations we can redefine the weight of each 
event in a counting experiment to be the fraction of interpretations passing the experimental cuts, 
instead of or 1 in a conventional analysis. We show that the statistical power of an analysis can be 
dramatically increased. In particular, we can have a 46% improvement in the statistical significance 
for the Higgs search with an associated Z boson (ZH — > vvbb) at the 8 TeV LHC. 



Jets are manifestations of the underlying colored par- 
tons in hard scattering processes. In order to reconstruct 
hard processes and uncover physics at high energy, jets 
are key objects to identify in high energy collider exper- 
iments. The conventional way to identify jets is to use 
clustering algorithms |l|— 15|] , where a parameter R sets an 
artificial jet size. The constituents of each reconstructed 
jet are those particles within an angular scale R away 
from the jet direction. This is particularly true for the 
anti-fey algorithm because it gives almost perfect cone 
jets in the calorimeter pseudorapidity-azimuthal angle 
(r)-(j)) plane. On the other hand, a jet is a distinct struc- 
ture in its own right with many collinear particles. The 
width of the localized energy distribution of the jet in 
the r\-(f) plane is an independent quantity and should be 
distinguished from the parameter R (FIG.HJ. 

Because the formation of jets is quantum mechanical 
and probabilistic, the widths of jets are always different 
(FIG. [2]). To reconstruct partonic kinematics we should 
pick a large enough R so that most of the radiation emit- 
ted by the partons is enclosed. However, with a large R 
more radiation contamination will be included. We can 
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FIG. 1: A cartoon calorimeter plot distinguishing the width 
of the localized energy distribution of a jet (red) from the pa- 
rameter R (blue) in the anti-kr algorithm. R is an artificial 
distance scale introduced to define the calorimeter region we 
want to look at. The jet axis points in the direction of the 
dominant energy flow, and the precise direction is not essen- 
tial here. 



manage to use jet grooming techniques J7H1G] to get rid of 
contamination. Algorithms with a large R may also fail 
to resolve jets in some events. Multiple partons may be 
in a fat jet which potentially has substructure. Without 
looking into jet substructure we may incorrectly include 
irrelevant jets in event reconstruction. In the end an R 



is chosen for all events to optimize an analysis (see 11 1 
for jets with variable R). A fixed R defines a single set 
of constituents for each jet and a single interpretation for 
each event. There is no choice of R in conventional clus- 
tering algorithms which can resolve jets and get most of 
the relevant radiation for all events. 
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FIG. 2: Two b jets with the same partonic kinematics but 
different widths, wider (top) and narrower (bottom). 
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FIG. 3: The invariant mass distribution of the two b jets for 
a ZH event with multiple interpretations using the telescop- 
ing jet algorithms (black). Using the anti-fcr algorithm with 
-R=0.7, 771^ = 143.4 GeV (red) which is outside the mass win- 
dow of 110 GeV < rrijj < 140 GeV in a conventional analysis. 
Using multiple interpretations reveals the ambiguity of this 
event and 37% of the interpretations pass the cuts (blue). 



Multiple event interpretations can provide extra infor- 
mation and help increase the statistical power of an anal- 
ysis. The recently proposed Qjet algorithms [l2j give 
multiple event interpretations using nondcterministic jet 
clustering. Unlike conventional clustering algorithms, 
Qjets merge pairs of particles probabilistically according 
to an exponential weight, resulting in different clustering 
histories. An event may have a wide range of interpre- 
tations, and the probabilistic nature of Qjets allows the 
correct event structure to emerge. It was shown that jet 
sampling with Qjets [l3( can help improve considerably in 
the statistical significance S/5B -the expected size of the 
signal divided by the background uncertainty- in many 
classes of analyses, and it is interesting to understand the 
essence of Qjets. 

In this paper we propose a simple way to define an 
event interpretation: each choice of R in jet algorithms 
gives a distinct event interpretation. The idea of probing 
jets with multiple i?'s is referred to as telescoping jets. 
As a first step we can apply conventional clustering al- 
gorithms on each event multiple times with different _R's. 
Note that, with a too-small R we may resolve an event 
in too much detail that miss its overall jet structure: in 
the R — > limit particles are all jets. On the other hand, 
with a too-large R we may fail to resolve close jets. To 
deal with these issues, we improve the algorithm by first 
using the anti-fc^ algorithm with a suitable R to reliably 
reveal the jet structure of an event and determine the jet 
axes from the reconstructed jet " cores" . These axes point 
in the directions of the dominant energy flow in an event, 
and the precise directions are not essential. We can also 
use the axes determined through a jet shape minimiza- 
tion procedure and bypass using clustering algorithms. 
Then we define jet constituents by the particles within 
a distance R away from the predetermined jet axes in 
the n-tfi plane. So different interpretations correspond to 
different jet constituents without the tree structure. 
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FIG. 4: The signal (top) and background (bottom) rrijj dis- 
tributions reconstructed using the anti-fcr algorithm with 
R=0.7 (red), as well as the telescoping anti-fcr (blue) and 
cone (green) algorithms. Using multiple event interpretations 
gives a wider signal Higgs mass peak, but it reduces the sta- 
tistical fluctuations of the rrijj distributions. 



However, another way of thinking about the above tele- 
scoping cone algorithm is that, we essentially move down 
the clustering sequence in the anti-fcr 1 algorithm to build 
up jets after identifying the branch structure. This is 
complimentary to moving up the reclustered tree and 
looking for mass drops to identify the branches [g, |7j]. 
Using different i?'s allows us to probe the energy distri- 
bution within each jet and give multiple event interpre- 
tations, and every observable of each event turns from a 
single number to a distribution (FIG.|3J). 

In the following we present the detailed procedure of 
the algorithm and apply it in a search for associated pro- 
duction of a Higgs and a Z where the Higgs decays to 
two fa jets and the Z decays to vv (ZH —> vvbh). The 
background is Z + bb from g — > bb. We require the events 
to pass a t^T > 120 GeV cut for the experimentally avail- 
able triggers. The bb system is slightly boosted so that 
the two fa jets are closer to each other and more difficult 
to resolve. We define the signal window (specified later) 
by imposing cuts on the invariant mass of the two b jets 
rrijj (FIG. 13} and the transverse momentum of each fa- 
jet in our analysis. With multiple interpretations, each 
event is counted by the fraction of interpretations pass- 
ing the cuts, instead of or 1 in a conventional analysis. 
As we will see, this increases the statistical stability of 
observables so that background fluctuations shrink con- 
siderably, which is the key for S/SB improvement. 
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FIG. 5: The signal and background z distributions ps(z) and 
Pb{z) using the telescoping anti-fcr and cone algorithms, z is 
the fraction of interpretations of an event passing the experi- 
mental cuts. A large fraction of both signal and background 
events can be interpreted differently. 



In the context of Higgs search in the ZH — > vvbb chan- 
nel, we first use the telescoping anti-fey algorithm to re- 
construct the two hardest jets with N different Rs, giving 
N interpretations for each event. The scaled-up compu- 
tation time is tiny compared to using nondeterministic 
clustering algorithms [13(. Here we take 7V=100. The 



value of R ranges from 0.2 to 1.5 with an increment -M?. 
The range of R is chosen because with the £•? > 120 
GeV cut the angular separation between the two b jets 
will be roughly < 2. Here each interpretation is weighted 
uniformly for simplicity. 

The telescoping cone algorithm which captures the jet 
structure more correctly goes as follows: 

• Use the anti-fey algorithm with i?=0.4 to recon- 
struct the cores of the two hardest jets and deter- 
mine the jet axes n\ and rii- 

• Define the i-th jet to be the particles within a dis- 
tance R away from m in the rj-cf) plane: 



JetR= {p | (Vp-VnJ 2 + 



^) 2 <R 2 }. (1) 



• In the case of overlapping jets, assign particles to 
the jet with the closer jet axis. This step is to avoid 
ambiguity and is not crucial when reconstructing 
the invariant mass of the two hardest jets rrijj. 

Here we use the same R for both b jets in an event. 
However, for generic beyond the standard model physics 
searches with both quark and gluon jets in the final state, 
one can exploit the full idea of using different R's for 
different jets. Wc will leave these for future studies. 

Our signal and background events were generated at 
the parton level usin g M adgraph 5 |l5| and then show- 
ered with Pythia 6.4 [16J for the 8 TeV LHC. Wc impose 
the $ T > 120 GeV cut at the Madgraph level and the 
following cuts in the analysis to define the signal window: 



FIG. 6: The signal (blue) and background (red) volatility 
distributions using the telescoping cone algorithm. 
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• Both ptS of the two hardest jets > 25 GeV. 

We use the anti-fey algorithm implemented in Fastjet 
v3.0.0 pjTj, [l8(, and we perform the analysis with R at 
the optimized value of R—0.7. Wc then study how the 
statistical significance of the Higgs search changes using 
multiple event interpretations. With N event interpre- 
tations rrijj turns from a single number to a distribution 
for each event. We define z to be the fraction of event in- 
terpretations passing the above cuts. FIG. [5] shows the z 
distributions ps(z) and Pb{z) for signal and background. 
This is in contrast to the conventional analysis in which 
an event either passes the cuts or does not. With mul- 
tiple event interpretations we can gain more information 
about the degree of certainty of an event being signal-like. 
Weighting each event by z in the counting experiment 
helps improve the significance of the analysis. 

Let e and a 1 be the mean and variance of the z distri- 
bution, and Ns and Nb be the expected numbers of sig- 
nal and background events produced at the 8 TcV LHC. 
Then the significance is equal to 
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(2) 



A more detailed discussion about statistics can be found 
in [H 0]. The volatility (FIG. EJ of each event is 
defined by V—T/(m), where T and (m) are the stan- 
dard deviation and mean of the rrijj distribution of each 
event with multiple interpretations. Volatility is useful 
in distinguishing boosted W jets from their QCD back- 
ground [12J, and we will leave exploiting volatility in 
Higgs searches for future studies. 

The performances of the algorithms are summarized 
in TABLE HI The key for the S/SB improvement is the 
shrink of background fluctuations, which comes from the 
rapid decrease of ob ■ For experimental studies with jet 
energy calibration depending on the parameter R, we try 
different ranges of i?'s and fewer interpretations using the 
telescoping cone algorithm. Note that we can get half 



R range 


TV 


algorithm 


weight 


S/SBt 


0.4 and 1.0 


2 


cone 


z 


14% 


0.4 to 1.0 


7 


cone 


z 


20% 


0.4 to 1.5 


12 


cone 


z 


26% 


0.2 to 1.5 


100 


anti-fcr 


z 


20% 


0.2 to 1.5 


100 


cone 


z 


28% 


0.4 to 1.5 


12 


cone 


Ps/pB 


38% 


0.2 to 1.5 


100 


cone 


Ps/pB 


46% 



TABLE I: S/8B improvements using telescoping jets with dif- 
ferent ranges of R, numbers of interpretations N, jet algo- 
rithms and weights in the counting experiment. 



the improvement by using just two i?'s, and using 12 R'a 
between 0.4 and 1.5 performs almost as good as using 
100 R's between 0.2 and 1.5. 

With ps(z) and pb{z) we can get an even larger im- 

13[ in the 



provement with the optimized weight ps v\ 
counting experiment. Then the significance is equal to 




(3) 



and we get a 46% improvement compared to the conven- 
tional analysis. For i?=0.4 to 1.5 with increment 0.1 we 
can get a 38% improvement with just 12 R's. 

To conclude, the width of the localized energy distri- 
bution of a jet may not match well with the parameter R 
in jet algorithms. The situation is even more complicated 
for events with close jets because resolving jets becomes 
an issue when the parameter R and the distance between 
jets confront with each other. We explore a simple and 
promising way of giving multiple interpretations for each 
event by changing the parameter R in jet algorithms. 
The approach increases the statistical stabilities of ob- 
servables which leads to remarkable improvement in the 
significance of a refined counting experiment. Telescop- 
ing jets open up the possibility of refining and improving 
jet physics analysis in high energy experiments. 

Also, wc only look at the transverse momenta and in- 
variant mass of the two b jets, which arc obscrvablcs 
at high energy scales. It would be interesting to see 
how much more we can improve the significance of Higgs 
searches in hadronic channels by combining the anal ysis 
with other jet substructure 19h21] and color flow [22l.l23j 
observables, which probe softer sectors of QCD and color 
connections in an event. The approach of using multi- 
ple event interpretations could potentially be combined 
with likelihood ratio test and multivariate analysis, and 
in the presence of pile up our method will have to combine 
with jet grooming techniques. Applications of telescop- 
ing jets beyond physics searches, for example observable 
measurements, are also worth investigating. Probing jets 
with multiple Rs may also allow us to construct jet ob- 
servables more reliably. 
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